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5 METHOD AND APPARATUS FOR DETECTING PREDETERMINED 

MOLECULAR STRUCTURES IN A SAMPLE 

Field of the Invention 

]0 The present invention relates to methods and 

system for molecular detection. 

Background of the Invention 

15 An increased effort has been directed toward 

the development of chips for molecular detection. 
Typically, a molecular detection chip includes a 
substrate on which an array of binding sites is 
arranged. Each binding site, or hybridization 

20 site, has a respective molecular receptor which 

binds or hybridizes with a molecule having a 
predetermined structure. 

A sample solution is applied to the molecular 
detection chip, and molecules in the sample bind or 

25 hybridize at one or more of the binding sites. The 

particular binding sites at which hybridization 
occurs are detected, and one or more molecular 
structures within the sample are subsequently 
deduced. 

30 Of great interest are molecular detection 

chips for gene sequencing . These chips , often 
referred to as DNA chips, utilize an array of 
selective binding sites each having respective 
single-stranded DNA probes. A sample of single- 

35 stranded DNA fragments, referred to as target DNA, 

is applied to the DNA chip. The DNA fragments 
attach to one or more of the DNA probes by a 
hybridization process. By detecting which DNA 
probes have a DNA fragment hybridized thereto, a 
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FIG. 6 illustrates a reference pattern for 
detecting an a-c-t nucleotide sequence in a sample; 

FIG. 7 is an example of a pattern generated by 
a sample having an unknown molecular structure; 

FIG. 8 is a flow chart of additional "steps 
which can be utilized to detect the predetermined 
molecular structure; 

FIG. 9 is a block diagram of an apparatus for 
detecting a predetermined molecular structure in a 
sample; and 

FIG. 10 is a flow chart of an embodiment of a 
method of gene discovery in accordance with the 
present invention. 

Detailed Description of a Preferred Embodiment 



Embodiments of the present invention 
advantageously provide improved information 
processing approaches to detecting predetermined 
molecular structures using a miniaturized device 
25 having an array of biological sensors. Just as 

semiconductor devices are designed to perform 
specific functions, a diagnostic device in 
accordance with the present invention is designed 
to perform one or more specific diagnostic tests. 
30 FIG. 1 is a flow chart of an embodiment of a 

method of detecting a predetermined molecular 
structure in a sample. In general, the method can 
be utilized for detection of a variety of molecular 
structures in a variety of different types of 
35 samples. Examples of the different types of 

samples include, but are not limited to, medical 
samples, environmental samples, agricultural 
samples, and other samples applicable tc 
diagnostics . 
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5 hybridizes with a molecule having a predetermined 

structure . 

Each molecular receptor typically includes a 
biological or synthetic molecule having a specific 
affinity to the molecule to be detected. Of 

10 particular interest is a molecular receptor having 

a chain of at least one nucleotide to hybridize 
with a molecule having a complementary chain of at 
least one nucleotide. Here, for example, the 
molecular receptor can include a DNA probe for 

15 detecting a corresponding, complementary DNA 

sequence in the sample. 

It is noted, however, that the scope of the 
invention is not limited to sensing the 
hybridization of DNA molecules. For example, 

20 embodiments of the present invention can be 

utilized to detect RNA hybridization and antibody- 
antigen binding events. As another alternative, 
the molecular detection device can include an array 
of detection sites, such as in the context of an 

25 oligonucleotide ligation assay (OLA) . Using a 

ligase chain reaction, pairs of oligonucleotides 
are utilized to amplify a selected oligonucleotide 
sequence. To detect the selected oligonucleotide 
sequence, a corresponding detection site is 

30 screened for full-length ligated oligonucleotides 

using any of the sensing approaches described 
herein . 

As indicated by block 12, an optional step of 
tagging molecules within the sample is performed. 
35 Each molecule is tagged with a member which can be 

sensed by the molecular detection device. Such 
members are commonly referred to in the art as 
tags, markers, and labels. Examples of such 
members include, but are not limited to, 
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5 After hybridization, an optional step of 

removing unwanted molecules from the binding sites 
can be performed, as indicated by block 18. The 
step of removing unwanted molecules can be 
performed by generating an electric field 'having 

10 the same polarity as the charge of the unwanted 

molecules. The electric field acts to repel 
unwanted molecules from the binding sites. As an 
alternative to, or in conjunction with, the field- 
based approach, a thermally-assisted approach can 

15 t>e utilized to remove unwanted molecules. Here, 

the temperature at the binding sites is raised, in 
dependence upon a melting temperature, to 
dissociate partially-bound molecules from the 
molecular receptors. Regardless of the approach 

20 utilized, the unwanted molecules to be dehybridized 

can include unbound molecules and partially-bound 
(i.e. non-specif ically bound) molecules . 

Typically, the step of removing unwanted 
molecules does not remove all unwanted molecules 

25 from the binding sites. This step is beneficial, 

however, in improving the accuracy of detection as 
outlined in subsequent steps. 

As indicated by block 20, the method includes 
a step of sensing a pattern in which the sample 

30 binds to an array of binding sites in a molecular 

detection device. The pattern can be sensed using 
a variety of approaches, including but not limited 
to, optical approaches, radioactive-sensing 
approaches, electronic approaches, and magnetic 

35 approaches. The specific approach utilized depends 

upon the type of tagging member attached to the 
molecules in the sample. 

Preferably, the step of sensing the pattern 
includes sensing an intensity or a magnitude of 



determining a difference between the pattern and 
the reference pattern. Here, the predetermined 
molecular structure can be detected when a measure 
of the difference is within a predetermined range. 

For optical sensing embodiments, the step of 
comparing includes a step of comparing at least one 
image of the pattern to an image of the reference 
pattern . 

As indicated by block 24 , the method 
optionally comprises a step of determining a 
confidence level of detecting the predetermined 
molecular structure in the sample. The confidence 
level indicates a degree of significance of the 
result obtained in the step of comparing in block 
22. 

To screen the sample for a plurality of 
different molecular structures, the steps indicated 
by blocks 22 and 24 can be repeated for a plurality 
of different reference patterns. Here, for 
example, a genomic sample can be screened to 
determine if it contains any of a plurality of 
predetermined base sequences. 

FIG. 2 is a flow chart of an embodiment of a 
method of generating the reference pattern. 
Typically, the reference pattern is generated prior 
to performing the steps indicated in FIG. 1. 

As indicated by block 30, the method includes 
a step of providing a reference device having a 
like array of binding sites as the molecular 
detection device used for detection. If desired, 
the same molecular detection device can be utilized 
for generating the reference pattern and for 
subsequent detection of an unknown molecular 
structure in a sample . Typically, however, another 
like device is utilized. 
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5 molecular structure. Alternatively, the steps 

indicated by blocks 30, 32, 34, 36, 38, 40, and 42 
can be repeated to apply the same predetermined 
molecular structure to a number of like reference 
devices. Either approach may be utilized to 

!0 provide a plurality of reference patterns for the 

same predetermined molecular structure. 

Sensed patterns formed by a sample having an 
unknown molecular structure can be compared to each 
of the above-described plurality of reference 

15 patterns, or a statistical model thereof, to detect 

the predetermined molecular structure in the 
sample . 

FIG. 3 is a flow chart of another embodiment 
of a method of generating the reference pattern. 

20 As" indicated by block 50, the method includes a 

step of determining an architecture of the array of 
binding sites of the molecular detection device. 
This step can include determining a layout of the 
binding sites, and determining the type of 

25 molecular receptor at each of the binding sites. 

As indicated by block 52, the method includes 
a step of predicting a reference pattern in which 
the predetermined molecular structure binds to the 
array of binding sites. The reference pattern is 

30 predicted based upon the predetermined molecular 

structure and the architecture of the molecular 
detection device. Preferably, the reference 
pattern includes a predicted intensity of binding 
at each of the binding sites. 

35 Regardless of the approach taken, the 

reference pattern acts as a novelty filter which is 
predictive of a successful or a desirable test 
result . 



■ 



as binding sites indicated by reference numeral 64, 
have a lesser intensity of binding. Those binding 
sites having two mismatching complementary bases, 
such as those indicated by reference numeral 66, 
have an even lesser intensity of binding. "Binding 
sites with no matching complementary bases, such as 
those indicated by reference numeral 68, have a low 
intensity of binding. 

FIG. 6 illustrates a reference pattern for 
detecting an a-c-t nucleotide sequence in a sample. 
The reference patterns in FIGS. 5 and 6 can be 
sensed using a reference sample, or can be 
predicted based on the number of mismatching bases 
at each binding site. 

For purposes of illustration, the sequences in 
this example are assumed to have a specific 
orientation. As a result, an a-c-t sequence and a 
t-c-a sequence do not specifically hybridize at the 
same binding site. It is noted, however, that this 
should not be construed as a limitation in the 
scope of the present invention. 

FIG. 7 is an example of a pattern generated by 
a sample having an unknown molecular structure. 
The pattern is generated by applying a sample of 
tagged single-stranded DNA molecules to the 
molecular detection device, allowing the molecules 
to hybridize to the binding sites, and optionally 
removing unwanted molecules. 

The resulting pattern shows a high intensity 
of binding at a t-g-a site 70. If standard 
detection techniques were utilized, one would 
conclude that the sample includes an a-c-t 
nucleotide sequence (i.e. the complement of t-g-a). 
However, the overall pattern is better correlated 
to the reference pattern for the a-c-g nucleotide 
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5 selected binding sites, or can be modified for all 

of the binding sites. 

Thereafter, a step of sensing a pattern, 
indicated by block 82, is performed. By repeatedly 
raising the temperature and sensing a resulting 

10 pattern, a plurality of temperature-dependent 

patterns is generated. 

As indicated by block 84, a step of comparing 
at least one of the plurality of temperature- 
dependent patterns to a corresponding at least one 

15 of a plurality of temperature-dependent reference 

patterns is performed. Here, for example, each of 
the temperature-dependent patterns can be compared 
to a corresponding one of the temperature-dependent 
reference patterns. Alternatively, only selected 

20 ones of the temperature-dependent patterns can be 

compared to corresponding reference patterns. A 
correlation measure and/or a difference measure is 
computed based on this comparison. A predetermined 
molecular structure is detected when the measure is 

25 within a predetermined range. 

In one embodiment, the temperature-dependent 
pattern having a greatest variability of intensity 
is selected for comparison. The variability of 
intensity is greatest at a temperature which 

30 dissociates many non-specifically-bound molecules, 

but does not significantly dissociate specifically- 
bound molecules. This pattern can be compared with 
a corresponding reference pattern to detect a 
predetermined molecular structure. It is noted 

35 t hat a variety of different measures of variability 

can be utilized, including but not limited to, 
sample variance and sample standard deviation. 

FIG. 9 is a block diagram of an apparatus for 
detecting a predetermined molecular structure in a 
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5 present invention. As indicated by block 100, the 

method includes a step of sensing a pattern of 
detection for a sample applied to a molecular 
detection device having a plurality of detection 
sites. The sample is taken from a first species 

10 having unknown genes to be discovered. Any of the 

various approaches described herein can be utilized 
for sensing the pattern. 

As indicated by block 12, the method includes 
a step of determining an architecture of the 

15 plurality of detection sites of the molecular 

detection device. 

A step of reading a nucleotide sequence from a 
database is performed, as indicated by block 104. 
In general, any nucleotide sequence can be read. 

20 Of -particular interest, however, is a nucleotide 

sequence from a second species other than the first 
species. For example, the nucleotide sequence can 
include a gene from a fruit fly, -while the sample 
in which gene discovery is to be performed is from 

25 a human. 

As indicated by block 106, the method includes 
a step of predicting a reference pattern which 
would be detected if the nucleotide sequence were 
applied to the molecular detection device. As 

30 indicated by block 108, a step of comparing the 

pattern to the reference pattern is performed to 
determine whether the nucleotide sequence is within 
the sample. 

The steps of reading a nucleotide sequence 

35 from the database, predicting a reference pattern 

for the nucleotide sequence, and comparing the 
pattern to the reference pattern are repeated to 
discover the presence of genes across different 
species . 
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5 Claims 

1. A method of detecting a predetermined 
molecular structure in a sample, the method 
comprising the steps of: 

10 sensing a pattern in which the sample binds to 

an array of binding sites in a molecular detection 
device; and 

comparing the pattern to a reference pattern 
to detect the predetermined molecular structure in 
15 the sample. 

2. The method of claim 1 further comprising 
the step of determining a confidence level of 
detecting the predetermined molecular structure in 

20 the^ sample . 

3. The method of claim 1 wherein the step of 
comparing includes at least one of determining a 
difference between the pattern and the reference 

25 pattern, determining a correlation between the 

pattern and the reference pattern, comparing at 
least one image of the pattern to an image of the 
reference pattern, comparing a plurality of 
temperature-dependent patterns to a plurality of 

30 temperature-dependent reference patterns. 

4. The method of claim 1 wherein an intensity 
level of at least one of the binding sites is 
indicative of at least one of molecules bound at a 

35 respective binding site, a binding strength at a 

respective binding site, and a melting temperature 
at a respective binding site. 
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array cf binding sites or by predicting a pattern 
in which the predetermined molecular structure 
binds to the array of binding sites. 
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